Building an open-source development infrastructure for language technology projects

نویسندگان

  • Sjur N. Moshagen
  • Tommi A. Pirinen
  • Trond Trosterud
چکیده

The article presents the Giellatekno & Divvun language technology resources, more specifically the effort to utilise open-source tools to improve the build infrastructure, and the solutions to help adapt to best practices for software development. The article especially discusses how the infrastructure has been remade to cope with an increasing number of languages without incurring extra overhead for the maintainers, and at the same time let the linguists concentrate on the linguistic work. Finally, the article discusses how a uniform infrastructure like the one presented can be used to easily compare languages in terms of morphological or computational complexity, coverage or for cross-lingual applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Role of Public Private Partnership (PPP) in Building Society

It is found for the last two decades that there has been a rapid rise of PPPs across the world. A government in developing countries like India is using PPP arrangements for improved delivery of infrastructure services and social services. Public Private Partnerships which are an integral part of the new paradigm of good governance policy is the most recent addition in the world of society and ...

متن کامل

The Archway Project: Architecture

An unusual alliance called the ARCHway Project is developing an Edition Production Technology (EPT), a technological infrastructure for collaborative research, teaching, and learning between computer scientists and specialists in Old English. Our goal is to identify and solve problems of mutual importance in building image-based electronic editions of significant cultural materials. The EPT wil...

متن کامل

Modularisation of Finnish Finite-State Language Description - Towards Wide Collaboration in Open Source Development of a Morphological Analyser

In this paper we present an open source implementation for Finnish morphological parser. We shortly evaluate it against contemporary criticism towards monolithic and unmaintainable finite-state language description. We use it to demonstrate way of writing finite-state language description that is used for varying set of projects, that typically need morphological analyser, such as POS tagging, ...

متن کامل

Paving the Bare Spots Towards an Enterprise-wide Defense Service Bus

This paper describes how Department of Defense (DOD) policy groups responsible for net-centricity, interoperability, and transformation can facilitate the creation of a service bus that works for the whole enterprise instead of just within project stovepipes. Modeled after standards bodies like OASIS and open source development groups like The Apache Foundation, the approach defines an enterpri...

متن کامل

A Methodology to Prioritize the Construction Projects of New Railway Infrastructures for Privatization in Railway Networks (Case Study: Iran)

This study aims to develop a novel methodology to prioritize the construction of new railway infrastructures for privatization. The private sector can cooperate to solve the capacity problems of railway networks, by the construction of new infrastructure. The purpose of this study is to answer the basic question that whether the capacity problems of the railway networks can be solved simply by ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013